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ABSTBACT ' ' ' . 

This paper contains a critical eraluation of the 
res^rch conducted by O.L, Daris and Carl Personke on the use of 
reading readiness tests in English and Spanish for Spanish speaking 
eleaentary school pupils. DaTis and Personke indicaiecT/that, when 
Spanish speaking first graders vere tested in both Eiiglish and 
Spanis^i, lost of the differences were notj significant. The authors of 
this critique argue that Davis and personke aisinterprSted their data 
a;jd, tha:t the result has been a continued aisuse of standardized tests 
for Spanish speaking pupils, one of the objections vas that 
insufficient attention wjis given to language doainance factors and to 
the identification of pupil characteristics other than language. 
Another problea vas that the nature of the populatioii distributions 
contributing to the correlations between predictor and criterion 
aeiisures had not been carefully analyzed. (HKH) • 
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STANDARDIZED TESTING AND THE "SPANISH- SPEAKING MINORITIES : 

LOOKING BEYOND THE, CORRELATIONS ' 

In the decade since the inauguration of. the U. S. Offic^of 

, Education First Grade Reading Studies in 1964 ( The Reading TeJb ier. 

Hay-October, 1966)' questions have been'raised concerning the assess- ' 

^ent of minority group children '(Oakland, 1973) , special education 

placement (Ross, Toung and Cghen, 1971), and the use of available" 

reading readiness and achievement tests with children who come 

from a ^Spanish- speaking background (riorn*^ 1966)\ More recently, 

succinct s,tatements have been made concenning the implications for 

assessment of langyagf©; cbaractei^iatics for ^fiiack children (Bartell, 

* ' • • I ^ ^ . 

Grill, and Bryen, 1973) and^Mexican-American phildren (Matluc^ and 

Macfe, 1973)/ In 'addi^rion, reviews of major thebries'bf language 
development and tWeir implications "for reading instruction suggest 
that refiidfing research and n^thodologies are on the threshold of %^ 
major changes (DiVesta and Palermo, 1974). . 

Qespite an increasied -.tempo of interest •in tjie measurement of 
reading p^formanQe (Blanton, Farr and Tuinmati, .197i|) and the 
evaluatic^n of other lan^age arts (Fagani, Cooper and Jensen, 1975)., 
the v^ray tWt tests are bping used in our schools continues to gen- 
erate crilticism (Goodman, 1975), particularly iij urban^schooTs 
(MacGinitie, 1973). Of prime concern are test validity .(Allen, 
197»+), test fairness (Flaugh^,^ 1974) , 'tests and the disadvantaged 
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<Fitzgibbon^ II71+), and the use of t.ests in Ehglish with Spanish- * 
•speaking elementary school pupils (Davis and/ Personke , 1968; Personke 

3^and^Dav^ ► j 

.The remainder of this article is a critique o^ the Personke 

^ and Davis studies atid of the iks interpretation of their findings 

resulting in, the continuing misuse of standardized tests with 

Spanish-speaking school pupils.. 

Statement* #1 (Personke and 
^ Davis > 1969! DP. 7qufl?r; " 

• ''When Spanish-speaking f irst-graiiers ip four -classrooms 

we^e tested iii, both English andl Spanish, most of the differences 

were not signifiqant. These findings suggest that the English 

administration of the Metropolitan Readiness Tests probably 

did not result in inadequate assessment of, or testing l^as 

•aga;nst, Spanish-speaking* children, at least as far as the 
« 

language parameter was concerned." ^ 
Responses to Sta tement #1 

(1) Without any oral language assessment to determine the 
language dominance of the subjects, it. is not known to what extent 
the language. factors contributed to the scores, ive., monolingual 
English subjects '.scores on the Spanish .administration and mono- 
lingual Spanislrtujjjects' scores oh the English administration of 
the Metropolitan Readiness Test (MRT); 

(2) the MRT does not sample for established language 
dominance in the standardization population; ^ 

(3) since no description of instructional practices in 
reading is provided, the question of the predictive validity of 
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Testing 6 the minorities - 3 

the English and Spanish administrations of the MRT must regain 
unanswered; the fact that the MRT scores from the Spanish admini- 
stration wer^ not correlated with a Spanish version of the ' 
Metropolitan Achievement Test (MAT), other than none was avail- 
able, possibly reflects an instructional situation where beginning 
reading instruction Was in English only; 

(*+) the issue of-whether or not translated tests (para- 
phrased is probably a more accurate term) are appropriate for 
native speakers ofitSpanish has not been resolvecl} 

(5) a different kind of statement is proposed by this author 
as to what the findings suggest and gives quite^ another perspec- 
tive: The English administration of the Metropolitan Readiness 
Tests probably did not result in any more inadequate assessment 
of, or testing bias against, Spanish-speaking children than did 
the Spanish administration of the MRT;;. that is, one was as 
inadequate as th^ pthe:^j, , 

(6) any statement to the effect that any test in a language 
other than that of the child being tested is not biased is, on 
the face of it, absurd; ,/ 

(7) Table 1 is ba^ed upon Phase 1 of the Personke and Davis 
study and underscore^ the reasons why researchers must look 
beyond computed correlations. In the case of Group A, these 
children were judged deficient in the English language and placed 
in pre-first or "readiness classes." The thirty five children 
assigned to regular first grade classes (Group B) were apparently 
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TABLE 1 

Davls'and Personke, J Ed Meas . f^all, I968, p. 23J 



MRT 

Group A (N«53) 
"Pre-First" Sections 



Engl ish 
Mean 
SO 

%Me 

Spanish 
Mean 
SO . 
nile 



39.93 
13.65 
23 



33.98 
12.82 
(15) 

.76 



MRT . 
Group B 1;n-35) 
Regular First Gr. Sections 



if. Engl ish 
Mean 
SD 

XWe. 



Spanish 
■^an 
SD^ 
**Ile 



Total Group (N-88) 



'•9.86 


/ English 


/ Mean 


MM 


/■ SD 




/ *lle 






'55.10 / 


Spani sh 


Mean 


12.50 / 


SO 


(51) / 


mie 


•73 / 





'»i».67 
31 



44.06 
16:48 
(29)^; 



^parentheses Indicate *Me based on English-administration distribution. MRT 
Manual of Directions' , p. 11. . ' 

f \ . , . 

^difference favoring Group B English mean over Group/A English mean significant 
lt-3.44; p^^Ol). Group A vs. Group, B mean diffpre^ce for Spanish version 
apparently not testedfor significance, but Is ab/ut twice that for the English 
versloij. j / 
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judged r^ady to b,egin regular reading instruction. Examination • 
of the data strorigly suggests that Group A and Group B represent 
two different pupil populations as supported by the following: 

(a) The difference favoring the Group B English mean 
ov0r the Group A English mean was statistically significant 
(t/=3.*+*+; probability less than .01); the Group A mean of 39.93 
represents the^ 23rd percentile while the Group B mean of t+9.86 
Represents th^ i+Otb percentile; 

(b) ; although no t-test was apparently computed for 
the difference favoring the Group B Spanish mean (55.10) over 
the (^oup A Spanish mean (33.98), this difference is over twice • 
that for the English version. In addition, on the possibly ques^ 
tionable assumption that the percentiles based on. -the English 
administration distribution might be, comparable to the Spanish 
administration, the Group A mean would fall at the 15th percen- 
tile while the Group B mean, would fall at the 51st percentile. 

In any ckse, pupils 'assigned to Group B by virtue of being judged* 
competent in English also scored much higher than ^roup^ A on 
the Spanish ver^iqn. It is an interesting question as to why 
the Group A pupils scored lower on the Spanish version. than they 
did on the English while the reverse is true for the Group B 
students. 

(c) When Group A and^B are combined the nature of 
the distributions li^obscured so that the English (44.67) and 
Spanish (44.06) means are almost identical; thd correlation 
coefficient, thdugh s^iewhat lower (r=.63), ds still comparable 
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to the correlations between the English and Spanish versions foy 
Group A (r=.76) and Group B (r=.73). 

I 

Stat^ent # 2 (Personke and 

Davis > 1969: p, 8l ) , * . 

"The number of subjects reported here is half of the number 
reported in Phase I. the loss was not accidental. The 
two 'readiness' classed, had been introduced to reading so 
late in the year that it was impossible to administer a ' 
reading test at the end of the year . . . These children 
had very low scores o'n the Metropolitan Readinegs Tests and 
did not learn to read." 



Responses to Statement # 2 

(1) In the words of Personke and Davis (1969, p. 8i+), 
"It is not enough to note that a test is a valid predictor of 
success in reading if the prediction for a large group qf 

children is failure"; this suggests the need to look beyond the, 

^ ■■ . 

" correlations ; 

0 

(2) in Horn's ,1966 study, any student who successfully 
guessed the correct answer on three of the sixteen MRT Word 
Meaning items fell at the project mean (2.99; SD=2..01; National 
Mean = 8.67; SD-8.67); 

(3) algo in Horn's 1966 study, when 98% of the project . . 
sample fell in the two lowest MRT categories (D, low normal;., 
and E, low) for the Word Meaning subtest, there is little 
discrimination between levels of ability; this suggests ques^ 
tionable tes,t validity for this population; similarly 
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questionable test validity appears to exist* for the Personke, 
and Davis populatioti; ^ 

any meaningful pupil perception of testing task 
was unlikely iii Horn's 1966 sample when about 28% (l6i+ of 

584) scdred 0 or 1 on the«Word Meaning subtest; similar i^ifor- 
. ^ , 

nation is not available for the Personke and Davis studyi 

* f 

Statement # 3 (Personke and 

Davis, 1969: p. 82^ ^ 

"The Subtests of Word MUning and Listening,, of the 
trogglitan Readiness Tests, were not highly r^ated to 
reading success." , ^ ) 

Response to Statement # 3 ... 

<1) Rather than use the euphemism "not highly related" * 
with reference to the Subtests of Word Meaning and Listening 
in both the Spanish and English administrations, the r's shown 
in Table 2 below w^r^nt the following* 

(a) There was\o relationship between the MKT Word 
Meaning and -MAT Word Knowledge, Word Discrimination- and Reading 
on the English and Spanish administrations respectively; 

(b) there was .no_ relationship between the MRT 
Listening subtest, and MAT Word Knowledge, Word Discriminaticsn 
and Reading on the Spanish administration; 

(c) the relationship between the MRT Listening subtest 
for the English administration was low re: the MAT Word Knowledge 
and Word Discriminatiba; the correlation of .33 with tfie MAT 
Reading subtest is significant at the .OS-" level of cefnpetei^;' 



Testing e thia mlnoijitlef 6^ 



TABLE 2 



MRT Subtest,!, 
Woyd Meaning 



MAT Word Knowledge • 
MAT Word Discrimination 
MAT Reading 



English 
-.01 



:o4 

-.03 



*Sigiiificant at/ the .05 level 

/ 



Spanish 
.00 

.03 

.16 ■ 



MRT Subtest 2, ' 
Llattnlni 

English Spanish 
.28 . .02 



.18 
.33* 



.05 
.08 



4- ■ 
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(d) the foregoing instances of rio relationship very 

likely reflect a mixture of Spanish-dominant and. English-dominant 

subjects in^the sample j language dominance should be established 

prior to instruction, 

statement f 4 (Personke and 
Davis, 1969 > pp. 82-83) • 

"These findings indicate that the Metropoldtan Readiness Tests 
are useful in predicting certain reading-related ' achievements 
for Spanish-speaking first-grade pupils. Administration of the 
Metropolitan Readiness Tests in Standard English and coll^uial 
Spanish seen^^to yield similar predictions. Consequently, admini- 
\ stration of the test in English rather than in the pupils' native 
Spanish apparently does not result in test bias. The evidence 
does. not support mJles to question the general prd^fice of 
administering tests in English to Spanish-speaking school entrants 

Responses to^Sfeatement § 4 . 

/ ^ " A 

(1) Table 3 pulls together in tabular* form the narrative pre- 
sentation of data found on p. 82 §f Personke^d Davis and compels 
the followirig observations concerning how useful the MRT is in pre- 
\ dieting "certain reading -related achievements tor Spanish-speaking 
first-grade /pupils": * 

(a) Relationships between the total t MRT score and MAT 
Word K^wledge were significant at the .01 level of con^^^c^for 
both tl^e English/admihistration (r= .87) and the Spanisfi MRT admini- 
stration\(r=.^); in fact, all correlations are significant at either 
the .05 orWOl level of competence as reported by Personke and Davis' 
ajid pres^nte^ iii Table 3; , ' 



TABLE 3 



PERSONKE-DAVIS SIGNIFICANT MRT PREDICTORS OF MAT PERFORMANCE FAVORINft^: 



English MRT Administration 



^Spanish MRT Administration 



Total V(kj Score - 
MAT word 
, knowledge 

Alptjabet subtest 
"Vord knowledge 

Alphabet subtest - 
, word discrimina- 
tion i 

Alphabet subtest - 
reading 



t=6.'f5; p<.01 
##(r's«.87t; .6lt) 



t^'l.OO; p<.OI 

(r's«.78t; .38*) 

t«3.13; p<.01 

(r«s=.77t; .'♦St) 



t=3.22; p<.01 
(r's=.65t^ .32*) 



tSignlf ic^nt at the .01 level 
^Significant at the .05 level 



Copying subtest - 
ward knowl edge 

Copying subtest - 
word discrimiha- 
tion 

Copying subtest - 
. reading 



t^2.3k; p<.05 

t=2.j8; p<.05 
(r's=.69t; .47t) 



t=7.3'»; p<.OI 
" (r's=*.60t; .37*)' 



^Identified (Personke and Davis, p. 82) 
as "colloquial Spanish." 

,##Correlation, given first is for English 
administration in column -one; the Spanish 
administration in .column two. 
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(b) the same. number of subtests, three each, favor ^he , 
English and Spanish administration of the MRT; . , 

(c) foi^ the, correlation that ordinarily woui;fi be considered 
t^e mo3t significant,, i.e., betve|n the total *1RT sfeo^e and the MAT 
Reading, none was significant for either the English br Spanish 
administration; * 

. - ' ' ' ■ : ■ - . /. 

(2) an iirahediate question to be raised concerning the usefulness 
^ of MKTs for prediction purposes has to do with the economic and human 
defensibility of subjecting linguistically ^distihct qhildren to a 
testing task in- which. approximately half failed (Persdnke and Davis, 
1969: p, 81); although no coirapent is made concerning the psychological 
impact of failure on young children, a statement is made earlier 
concerning the impact on correlation computatj.ons ; "it is doubtful 
that the incision of data on the reading achievement of the readi- 
ness classes would have effected the coef|j^entS' of correlation \ 
in any but a positive way, since these children had very low ^ 
scores on the Metropolitan Readiness Tests and did^not learn tp 
read" (Personke and Davis, 1969:^ p. .BD^the time, expense and 
Negative psychological ^impact of such testing is open to serious 
question; • 

* 

.(3) the statement to the effect that the admin istr at iojn of, 
tests in English rather than in the pupils' native Spanish did not 
result in test bias, as mmtioned earlier, represents a prima 
facie absurdity; test bias, results when an^ child i^ tested in a 
language other than the one^ which is dominant for him; 

(4) the use of any currently published standardized readiness 
tests\with Spanish-dominant schbol beginners is highly .questionable. 
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especially in the southwest; in addition to cultural biases in. test 
items (eig., "mark ther^ toboggan" verses "mark the tamale ."); tests 
^^uch as the MRT fail to include proportional numbers ofu linguistically 
different -(or distinct , if you prefer) pupils in the standardization 
sample; even with the inclusion of a limited number of so-^ied 
Spanish-surname pupils, this is meaningless without oral language 
assessment to determine language dominance; 

(5) there would be few objections to administering tests in 
English to Spanish-speakinjg school entrants if, in fact, such 
piipils.are true bilinguals, i.e., as at home with Englis^i as with 
Spanish; for whatever- reason, the number -'of Spanish-^surname school 
beginjiers who are monolingual in English (or possibly more- to the 
point, those who decline- to acknowledge Spanisl^) is noticeably 
higher than just a few years ago. > ^ 

Conclusions . ^ ^.^^ 

. ' , * • 

When applying the results oi reading research to the classroom, 
a critic«*^^p2view of the research and research methodology is basic; 
tin this case: 

(1) lack of attention to language dominance factors and the 
identification o^differing pupil characteristics othet* than language 
constitute significant weaknesses an th^ study; ^ 

(2) . inj^teaJ of:;^geptinj|^^'at^^fe v^lue the validity of statis- 
tically significant^ correlations between predictor and cr iter 
measures, the nature of the j^ulation distributions contributing 

to those correlations needP to be carefully analyzed, particularly 
when distributions appear to be bi-=tnodal oi^ nearly so; and 

(3) unless the foregoing are made components of the l?esearch 
design and data analyi^a^s, the result is a continuing inappropriate 
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use of MKTs with Spanish-dominant school beginners. 
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